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Final  Report  on  AFOSR  Grant  F49620-92-J-0240 
Abstract  and  Project  Overview 

Abstract:  We  tested  the  new  LAN-capable  sensor/actuator  package  by  de¬ 
veloping  a  monitor  controller  for  the  Piranha  adaptive  parallelism  environment. 
Piranha  is  a  system  that  allows  processes  of  a  parallel  application  to  be  created 
dynamically  (for  example  on  newly  idle  LAN  nodes)  and  removed  dynamically 
(for  example,  when  an  owner  resumes  work  at  his  node)  while  the  computa¬ 
tion  as  a  whole  continues  without  interruption.  Overall  research  results  of  the 
AFOSR  Grant  #  F49620-92-J-0240  support  include  development  of  the  undcr- 
lying  Pirahna  system  for  adaptive  parallelism  to  manage  nodes  of  a  dedicated 
multiprocessor  efficiently,  specific  Network  Piranha  and  Multiprocessor  (CM-5) 
applications  and  implementations,  and  the  (ongoing)  development  of  a  practical 
software  application  called  ^^Lifestreams/' 

Project  Overview:  The  Piranha  systems  poses  a  number  of  monitoring 
and  control  problems:  it  requires  that  the  current  idle/busy  and  ^idleness  crite¬ 
ria”  status  of  all  nodes  be  maintained,  that  predictions  be  developed  with  respect 
to  likely  future  idleness  patterns  of  each  node  in  the  pool,  that  the  status  of 
all  Piranha  applications  and  their  behaviors  be  maintained  and  that  Piranha 
jobs  be  assigned  to  particular  idle  nodes,  among  other  issues.  The  problem  was 
a  good  test  bed  for  the  scnsor/actuator  LAN  trellis  because  it  required  actu¬ 
ator  and  not  just  sensor  capability,  It  was  inherently  distributed  and  required 
LAN  capacity,  and  it  was  inherently  a  significant,  interesting  problem.  The 
Piranha-trellis  we  developed  worked  sufficiently  well  to  suggest  that  the  entire 
Piranha  system  (and  not  jusst  monitor-control  functions)  might  be  structured 
as  a  trellis.  A  trellis-structured  wide  area  Piranha  system  being  developed. 

The  use  of  a  system  like  Trellis  to  monitor  and  control  a  wide-area  Piranha 
network  poses  one  obvious  difficulty.  Scheduling  decisions  are  centralized  in  the 
Trellis;  we  need  to  insure  that  propagation  delays  between  available  network 
nodes  and  the  Trellis’s  scheduling  modules  are  not  such  as  to  make  any  decisions 
of  a  centralized  scheduler  obsolete  by  the  time  they  are  made,  and  still  more 
obsolete  when  they  are  put  into  effect. 

The  hierarchical  design  of  the  Trellis  gives  us  a  framework  for  addressing 
these  problems.  Our  intention  is  to  insure  that  relatively  more  time-critical 
decisions  are  made  relatively  closer  to  the  nodes  to  which  they  are  relevant. 

Scheduling  decisions  that  might  entail  moving  processes  from  domain  M  to 
domain  N  are  made  in  general  by  the  lowest-ranking  Trellis  module  that  spans 
N  and  M;  such  modules  will  ‘^as  close  as  possible”  to  the  domains  they  monitor. 
The  fewer  domains  a  scheduling  module  monitors;  the  greater  the  likelihood  that 
the  module  can  be  placed  ‘‘relatively  close”  to  all  of  them— assuming  that  we 
use  network  geography  as  a  guide  when  we  group  domains  together-  The  higher- 
ranking  a  scheduling  module — the  more  domains  it  covers,  and  the  farther  it 
is  from  its  most-distant  domain — the  more  conservative  the  scheduling  module 
must  be  in  moving  processes  around  the  network. 
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Suppose  that  a  scheduling  module  is  responsible  for  deciding  whether  pr«^ 
cesses  should  be  moved  from  the  “main  campus”  to  the  “medical  s^ool,  and 
assume  that  those  domains  are  far  apart  in  terms  of  network  geography.  It  mus 
be  willing  to  guess  that  any  idle  nodes  serving  as  targets  for  new  pioces^  will 
be  idle  at  least  as  long  as  the  delay  involved  in  learning  about  their  stati^ 
and  moving  processes  to  them— otherwise  it  has  no  reason  to  expect  that  they 
wiU  still  be  idle  when  the  processes  arrive.  But  of  course  m  practice,  it  must 
be  more  conservative  than  that,  in  order  to  justify  the  overhead  incurred  by 
moving  processes  from  one  domain  to  a  distant  other  domain.  Sebe  u  ers  a 
increasingly  higher,  more  global  levels  must  have  increasingly  large  estimates 
for  likely  node  idle  times  in  hand  in  order  to  justify  moving  processes. 

In  sum,  the  TtelUs’s  mherently  hierarchical  structure  provide  what  seems 
like  exactly  the  right  framework  for  dealing  with  increasingly  global  Pir^ha 
scheduling  decisions. 

Research  Results  /  AFOSR  Grant  #  F49620.92-J-0240 


1)  Adaptive  Parallelism  and  Piranha  •  „  u 

Adaptive  parallelism  refers  to  parallel  computations  on  a  dynamically  chang¬ 
ing  set  of  processors:  processors  may  join  or  withdraw  from  the  computation  as 
it  proceeds.  Networks  of  fast  workstations  are  the  most  important  setting  for 
adaptive  parallelism  at  present.  Workstations  at  most  sites  are  typically  idle 
for  significant  fractions  of  the  day,  and  those  idle  cycles  may  constitute  in  the 
aggregate  a  powerful  computing  resource.  For  this  reason  and  others,  we  believe 
that  adaptive  parallelism  is  assured  of  playing  an  increasingly  prominent  role  m 
parallel  applications  development  over  the  next  decade. 

Under  adaptive  parallelism,  the  set  of  processors  executing  a  parallel  pr^ 
gram  may  grow  or  shrink  as  the  program  tuns.  Potential  gains  inclu  ®  ^  ® 
capacity  to  tun  a  parallel  program  on  the  idle  workstations  in  a  conventional 
LAN— processors  join  the  computation  when  they  become  idle,  and  withdraw 
when  their  owners  need  them-and  to  manage  the  nodes  of  a  dedicated  multi- 
processor  efficiently.  Experience  to  date  with  our  Piranha  system  for  adaptive 
parallelism  suggests  that  these  possibilities  can  be  achieved  in  practice  on  real 
applications  at  comparatively  modest  costs.  _ 

2)  Network  Piranha  and  Multiprocessor  (CM- 5)  Piranha 
Piranha  is  an  adaptive  version  of  master-worker  parallelism.  Programmers 
specify  in  effect  a  single  general  purpose  (but  application  specific)  worker  func¬ 
tion  called  piranha().  They  do  not  create  processes  and  their  applications  do 
not  rely  on  any  particular  number  of  active  processes.  When  a  proceswt  be¬ 
comes  available,  anew  process  executing  the  pitanha()  function  is  created  there; 
when  a  processor  withdraws,  a  special  retreat()  function  (which  is  also  provided 
by  the  application  programmer)  is  invoked,  and  then  the  local  piranha  process 
is  destroyed.  Thus,  there  are  no  “create  process”  operations  in  the  user  s  pro¬ 
gram,  and  the  number  of  participating  processes  (and  not  merely  processors) 
varies  dynamically. 
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The  Multiprocessor  (CM-5)  Piranha  research  explores  Piranha  on  the  Con¬ 
nection  Machine  DM-5  multiprocessor.  Our  preliminary  results  suggest  that 
adaptive  parallelism  provides  not  only  substantial  computing  power  for  parallel 
programs  but  also  an  attractive  alternative  to  traditional  methods  for  sharing 
multiprocessors  among  users. 

3)  “Lifestreams”;  A  Piranha  application  being  developed  from  re¬ 
search  under  AFOSR  Grant  #  F49620-92-J-0240 

Today,  just  managing  one’s  own  electronic  world  can  be  a  frustrating  task 
for  most  computer  users,  requiring  too  many  separate  applications,  too  many 
file  transfers  and  format  translations,  and  the  construction  of  organizational 
hierarchies  that  too  quickly  become  obsolete.  What  is  needed  is  a  metaphor 
and  system  for  organising  the  electronic  “bits  of  paper”  we  all  so  easily  collect, 
whether  they  come  to  us  in  the  form  of  electronic  mail,  downloaded  images, 
pages  gathered  from  the  Web,  or  scheduling  reminders. 

Lifestreams  is  such  a  system.  It  uses  a  simple  organizational  metaphor,  a 
time-ordered  stream  of  documents,  to  replace  conventional  files  and  directories. 
Stnam  filters  and  software  agents  ate  used  to  organize,  locate,  summarize  and 
monitor  incoming  information.  Lifestreams  subsumes  many  separate  desk¬ 
top  applications  to  accomplish  the  most  common  communication,  scheduling, 
and  search  and  retrieval  tasks;  yet  its  machine-independent,  client-server  ar¬ 
chitecture  is  open  so  that  users  can  continue  to  use  the  document  types  and 
viewers/editors  they  are  accustomed  to.  Currently  we  are  running  a  research 
prototype  of  the  system  in  house,  and  hope  eventually  to  make  the  software 
publicaily  available. 
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